Endogenous Post-Stratification in Surveys: Classifying with a Sample-Fitted Model

نویسندگان

  • F. Jay Breidt
  • Jean D. Opsomer
چکیده

Post-stratification is frequently used to improve the precision of survey estimators when categorical auxiliary information is available from sources outside the survey. In natural resource surveys, such information is often obtained from remote sensing data, classified into categories. These stratification categories may be constructed based on models fitted to the sample data (“endogenous post-stratification”), thereby violating the standard post-stratification assumptions that observations are classified without error into post-strata, and post-stratum counts are known. Properties of the endogenous post-stratification estimator are derived for the case of a sample-fitted generalized linear model, from which the post-strata are constructed by dividing the range of the model predictions into predetermined intervals. Consistency and asymptotic normality of the endogenous post-stratification estimator are established, showing that it has the same asymptotic variance as the traditional post-stratified estimator with fixed strata. Simulation experiments demonstrate that the practical effect of first fitting a model to the survey data before post-stratifying is small, even for relatively small sample sizes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

“Equivalent Linear Composition” as an Efficient Stratification Factor in Multipurpose Surveys

Horticulture survey is a multi-purpose survey which is conducted ad hoc by Statistical Center of Iran (SCI). Availability of survey variables in the sampling frame suggests a multivariate stratification in each province based on its desired variables for acquiring a higher efficiency. There are several ways to stratify the sampling frame considering all stratification variables, such as using s...

متن کامل

Multiple Imputation in a Complex Sample Survey

Multiple imputation for missing survey data is relatively new concept. As defined by one of its leading proponents, "multiple imputation is the technique that replaces each missing or deficient value with two or more acceptable values representing a distribution of possibilities" (Rubin 1987, p.2). Multiply-imputed data reflects the uncertainty contained in the imputation process in a way not p...

متن کامل

Post - Stratification and Conditional Variance Estimation Richard Valliant

Post-stratification estimation is a technique used in sample surveys to improve efficiency of estimators. Survey weights are adjusted to force the estimated numbers of units in each of a set of estimation cells to be equal to known population totals. The resulting weights are then used in forming estimates of means or totals of variables collected in the survey. For example, in a household surv...

متن کامل

طبقه‌بندی تعداد فرزندان زنده به دنیا آمده با استفاده از مدل کارت (CART)

Background and Objective: Discriminant analysis and logistic regression are classical methods for classifying data in several studies. However, these models do not lead in valid results due to not meeting all necessary assumptions. The purpose of this study was to classify the number of Children Ever Born (CEB) using decision tree model in order to present an efficient method to classify demogr...

متن کامل

Measuring Local Heterogeneity with 1990 U.S. Census Data

A sample covering 204,394 blocks from the 1990 U.S. Census permits measurement of residual heterogeneity from local area to local area after controlling by stratification for demographic characteristics such as race, ethnicity, age, sex as well as geographic characteristics such as region and place-type. The local areas have populations on the order of 10,000 people. The variables studied are f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002